CPG-Actor: Reinforcement Learning for Central Pattern Generators

نویسندگان

چکیده

Central Pattern Generators (CPGs) have several properties desirable for locomotion: they generate smooth trajectories, are robust to perturbations and simple implement. However, notoriously difficult tune commonly operate in an open-loop manner. This paper proposes a new methodology that allows tuning CPG controllers through gradient-based optimisation Reinforcement Learning (RL) setting. In particular, we show how CPGs can directly be integrated as the Actor Actor-Critic formulation. Additionally, demonstrate this change permits us integrate highly non-linear feedback from sensory perception reshape oscillators’ dynamics. Our results on locomotion task using single-leg hopper explicitly rather than part of environment significant increase reward gained over time (20\(\times \) more) compared with previous approaches. Finally, our closed-loop progressively improves hopping behaviour longer training epochs relying only basic functions.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Central pattern generators

What are they? Central pattern generators (CPGs) are relatively small, relatively autonomous groups of neurons (neural networks) that produce patterned, rhythmic neural outputs that drive rhythmic behaviours. In addition to generating boring behaviours like walking, CPGs are also responsible for dancing, chewing, swallowing, suckling, copulation and orgasm — all the things that make life worthw...

متن کامل

Reinforcement Learning for CPG-Driven Biped Robot

Animal’s rhythmic movements such as locomotion are considered to be controlled by neural circuits called central pattern generators (CPGs). This article presents a reinforcement learning (RL) method for a CPG controller, which is inspired by the control mechanism of animals. Because the CPG controller is an instance of recurrent neural networks, a naive application of RL involves difficulties. ...

متن کامل

Distributed Online Learning of Central Pattern Generators in Modular Robots

In this paper we study distributed online learning of locomotion gaits for modular robots. The learning is based on a stochastic approximation method, SPSA, which optimizes the parameters of coupled oscillators used to generate periodic actuation patterns. The strategy is implemented in a distributed fashion, based on a globally shared reward signal, but otherwise utilizing local communication ...

متن کامل

Central pattern generators for bipedal locomotion.

Golubitsky, Stewart, Buono and Collins proposed two models for the achitecture of central pattern generators (CPGs): one for bipeds (which we call leg) and one for quadrupeds (which we call quad). In this paper we use symmetry techniques to classify the possible spatiotemporal symmetries of periodic solutions that can exist in leg (there are 10 nontrivial types) and we explore the possibility t...

متن کامل

Commentary/Selverston: Central pattern generators

and ion channels must be functionally described in order to obtain a full understanding of a CPG, we will not have a detailed, mechanistic explanation for some considerable length of time. A complete compilation of the detailed molecular biophysics of neurons will long remain the "quark" of cellular and integrative neurobiology. The most disturbing feature of the Selverston paper is its pessimi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-89177-0_3